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Abstract. Previous derivations of the sum and product rules of probability theory relied on the 
algebraic properties of Boolean logic. Here they are derived within a more general framework 
based on lattice theory. The result is a new foundation of probability theory that encompasses and 
generalizes both the Cox and Kolmogorov formulations. In this picture probability is a bi-valuation 
defined on a lattice of statements that quantifies the degree to which one statement implies another. 
The sum rule is a constraint equation that ensures that valuations are assigned so as to not violate 
associativity of the lattice join and meet. The product rule is much more interesting in that there 
are actually two product rules: one is a constraint equation arises from associativity of the direct 
products of lattices, and the other a constraint equation derived from associativity of changes of 
context. The generality of this formalism enables one to derive the traditionally assumed condition 
of additivity in measure theory, as well introduce a general notion of product. To illustrate the 
generic utility of this novel lattice-theoretic foundation of measure, the sum and product rules are 
applied to number theory. Further application of these concepts to understand the foundation of 
quantum mechanics is described in a joint paper in this proceedings. 
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INTRODUCTION 

A lattice is an algebra. Where an algebra considers a set of elements along with a set 
of operations that takes one or more elements to another element, the lattice considers 
a set of elements along with a binary ordering relation that sets up a hierarchy among 
the elements. The algebraic perspective is operational, whereas the lattice perspective is 
structural. Both the operational and structural relationships among elements are useful. 

Posets, Lattices and Algebras 

Two elements of a set are ordered by comparing them according to a binary ordering 
relation, generically denoted < and read 'z^' included by\ One of the first examples that 
comes to mind may be the ordering of the integers according to the usual meaning of 
the symbol < 'is less than or equal to\ The ordering that results is called a chain (Fig. 
[T^). To illustrate the hierarchy, we simply draw element B above element A ii A <B 
and connect them with a line if there does not exist an element X in the set such that 
A<X<B. 

In some cases, elements of the set are incomparable to one another, as in the common 
example of comparing apples and oranges. Figure [IJ5 shows an antichain of card suits 
where the elements are placed side-by-side to indicate that no element includes any 
other. 




FIGURE 1. Three basic examples of posets. (a) The integers ordered by the usual < form a chain. The 
element 2 is drawn above 1 since 1 < 2, and they are connected by a line because 2 covers 1 in the sense 
that there is no integer x between 2 and 1 such that 1 < x < 2. (b) The four card suits are incomparable 
under a wide variety of card game rules and we draw them side-by-side to express this. This configuration 
is called an antichain. (c) The set of partitions of three elements a, b and c ordered by partition containment 
forms a more complex poset that exhibits both chain and antichain behavior. One chain consists of the 
elements a\b\c, a\bc, and abc since each successive partition contains the previous. The elements a\bc, 
b\ac, and c\ab form an antichain because not one of these three partitions contains another. 



More interesting examples involve both inclusion and incomparability, which is why 
these structures generally are called partially ordered sets, or posets for short. Figure [It 
illustrates the lattice that results from partitioning three objects. One could consider all 
three objects together abc, or each separately a\b\c. We could also partition the objects 
in these ways: a\bc, b\ac or c\ab. These partitions can be compared according to a 
relation that decides whether one partition includes another. The partition abc includes 
the partition a\b\c since it can be obtained by simply sub-dividing abc. However, the 
partitions c\ab and a\bc are incomparable since, for example, there is no way to sub- 
divide the partition c\abio obtain a\bc. 

Given a set of elements in a poset, their upper bound is the set of elements that contain 
them. For example, the upper bound of the partition c\ab in Fig. [ih is the set {abc}. 
Given a pair of elements x and y, the least element of their upper bound is called the join, 
denoted .x;Vy. The lower bound of a pair of elements is defined dually by considering all 
the elements that the pair of elements contain. The greatest element of the lower bound 
is called the meet, denoted x /\y. h lattice is a partially ordered set where each pair of 
elements has a unique meet and a unique join (Fig.[2l). Graphically, the join can be found 
by starting at both elements and following the lines upward until they first intersect. The 
meet is found similarly by moving downward. There often exist elements that are not 
formed from the join of any pair of elements. These elements are called join-irreducible 
elements. Meet-irreducible elements are defined similarly. For example, the partitions 
a\bc,b\ac ox c\ab cannot be formed by joining any other pair of partitions and therefore 
are join-irreducible. In this case, these elements are also meet-irreducible. 

We can choose to view the join and meet as algebraic operations that take any two 
lattice elements to a unique third lattice element. From this perspective, the lattice is an 
algebra. This gives us both a structural and operational perspective which are related by 
a set of equations called consistency relations 

^ ^ , xyy=y 

x<y A (1) 

— ■^ x/\y = x 
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FIGURE 2. The poset on the left is a simple lattice, which illustrates the join V and the meet A. The 
poset on the right is not a lattice since the pair of elements on the bottom do not have a unique least upper 
bound. Similarly, the pair of elements at the top do not have a unique greatest lower bound. 



Given a specific lattice, we find that the consistency relations result in a specific 
algebraic identity. For example, the integers ordered by the usual 'less than or equal 
to' leads to 

x<y ^ max(x,3^)=3^ 

whereas the positive integers ordered by 'divides' leads to 

x\y ^ 3^1=3' (3) 

' gcd(x,j) =x 

Sets ordered by the usual 'is a subset of leads to 

^ xUy = y 

xCy (4) 

— ■^ xHy = x 

SPACES 

Lattices enable us to describe our world and encode the order that we observe. However, 
a description of the world is not the same as a description of our state of knowledge about 
the world. Here we examine the different spaces that we can construct using lattices. 



Components and State Space 

Systems are often made of components, and these components may have a natural 
order. Consider the components of a bridge: the left side L, the right side R, and the 
span 5, which sits on top of the two sides. The relationship between the components of 
a bridge is illustrated in Fig. [3l 

The states of a bridge can be constructed from the components by taking the set of 
downsets of bridge components and ordering them according to set inclusion. A downset 
is a set of components, such that if an element is a member of the set, then all elements 
that it includes are also members of the set. The result is a lattice of possible bridge 
states where each state is a set of bridge components. Note that the downset construction 
prevents one from including sets of components such as {L, 5} from being included as a 
state, since the span can only be present if both the left and right sides are present. 




L R {L} {R} 

FIGURE 3. The poset on the left encodes the relationship among the components of a bridge: the left 
side L, the right side R, and the span S, which sits on top. By taking the set of downsets ordered by set 
inclusion, we obtain a lattice describing all the possible states of a bridge where each state is a set of 
bridge components. 



Not all State spaces are constructed from components. Instead the states may represent 
the most elementary description of the system. Often, the poset of states forms an 
antichain consisting of independent and mutually exclusive elements. 



Hypothesis Space 

The bridge may be in one of four possible states of construction. A given individual 
may not know precisely which state the bridge is in, but may have some information that 
rules out some states, but not others. This set of potential states defines what one can say 
about the state of the bridge. For this reason, I call a set of potential states a statement. 
In this sense, a statement describes a state of knowledge about the state of the bridge. 
Note that quantification will come later. 

The lattice of statements is generated by taking the powerset, which is the set of all 
possible subsets of the set of all states, and ordering them according to set inclusion. 
In this example, since states are sets of components, statements are sets of sets. Other 
examples have been given in previous works [I,'?]. Given that there are four possible 
states, there are 2^ = 16 statements including the null set, which is generically called the 
bottom and denoted ± since it resides at the bottom of the lattice. The bottom element 
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{{L},{R},{L,R},{L,R,S}} 
{{L},{R},{L,R}} {{L},{R},{L,R,S}} {{L},{L,R},{L,R,S}} {{R},{L,R},{L,R,S}} 



{{L},{R}} {{L},{L,R}} {{L},{L,R,S}} {{R},{L,R}} {{R}, {L,R,S}} {{L,R},{L,R,S}} 
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{L,R} {L,R,S} 



FIGURE 4. This figure depicts the lattice of all statements that can be made about the state of the 
bridge. A statement is defined as a set of potential system states. The ordering relation of set inclusion 
naturally encodes logical implication (deduction), such that a statement impUes all the statements above 
it. 



is often omitted from the diagram due to the fact that it represents the logical absurdity. 
These statements, illustrated in Fig. HI form the hypothesis space. The ordering relation 
of set inclusion naturally encodes logical implication, such that a statement implies all 
the statements above it. 

The statements along the bottom of the lattice represent states of knowledge where 
the state of the bridge is known with certainty. The statement at the top is the truism, 
generically called the top and denoted T, which represents the state of knowledge 
where one only knows that the bridge can be in one of four possible states. Intermediate 
statements represent intermediate states of knowledge. For example, you may know that 
the right side R will be constructed by either Worker A, who is a very hard worker, or 
Worker B who is very lazy. In this case your state of knowledge about the state of the 
bridge would probably be the intermediate state {{L}, {L,R,S}}, which says that either 
just the left side is up or the whole bridge is finished. 

Logical deduction is straightforward in this framework since a statement in the lat- 
tice implies (is included by) every statement above it with certainty. Logical induction 
works backwards. One would like to quantify the degree to which one's current state of 
knowledge implies a statement of greater certainty below it. Since statements do not im- 
ply statements below them, this requires a generalization of the algebra representing the 
ordering. In the next section we will generalize this algebra to a calculus by introducing 
quantification. The result is a measure, called probability, that quantifies the degree to 
which one statement implies another. 

Inquiry Space 

One can take this idea of states and statements further. By defining a question in terms 
of the statements that answer it [3], one can generate the lattice of questions by taking 
downsets of statements [4]. That is, if a given statement answers a question, then all the 
statements that imply that statement also answer the question. In the case of our bridge, 
the lattice of questions has 167 elements. 

Just as some statements imply other statements, some questions answer other ques- 
tions. Answering a given question in the lattice will guarantee that you have answered all 
the questions above it. One can also generalize this algebra to a calculus by introducing 
quantification. The result is the inquiry calculus, which is based on a measure, called 
relevance, that quantifies the degree to which one question answers another. 

For more details on the lattice of questions and inquiry, see the following references 
[0, 11,0,0], as well as the paper by Julian Center in this proceedings 

QUANTIFICATION 

An algebra can be extended to a calculus by defining functions that take lattice elements 
to real numbers. This enables one to quantify the relationships between the lattice 
elements. A valuation v is a function that takes a single lattice element x G L to a real 
number v{x) in a way that respects the partial order, so that v{x) < v{y) iff x <y. This 




FIGURE 5. The poset on the left is used to establish the additive nature of the valuation. The poset in 
the center is used to establish the sum rule for the lattice in general. The cartoon on the right illustrates 
the symmetry of the sum rule. The sum of the valuations of the elements at the top and bottom of the 
diamond equals the sum of the valuations of the elements on the right and left sides. These dashed lines 
conveniently form a plus sign reminding us of the sum rule. 



means that the lattice structure imposes constraints on the valuation assignments, which 
can be expressed as a set of constraint equations. 

The valuation assigned to element x can be defined with respect to a second lattice 
element y called the context. The result is a function called a bi-valuation •w{x \ y) — 
Vy{x), which takes two lattice elements x and y to a real number. Here a solidus is used 
as an argument separator so that one reads | as the degree to which y includes x. 

In the following sections, we consider three symmetries and identify the constraints 
that they impose on valuation and bi-valuation assignments. The first two symmetries 
result from the lattice structure and thus impose the same constraints on the valuation 
and bi-valuation assignments; whereas the last symmetry considered is specific to bi- 
valuations. 



Sum Rule 

We begin by considering a special case depicted in Fig. [5] (left) of two elements x and 
y with join x\/y and a null meet x/\y = 1. (not shown). The value we assign to the join 
x\/y, written u{x\/y), must be a function of the values we assign to both x and y, u{x) 
and u{y), since if there did not exist any functional relationship, then the valuation could 
not possibly reflect the underlying lattice structure. We write this functional relationship 
in terms of an unknown binary operator © 

u{xyy) =u{x)®u{y). (5) 

We now consider another case where we have three elements x, y, and z, such that their 
meets are again disjoint. The least upper bound of these three elements can be written in 
these two different ways: x\/(y\/ z) and {x\/y) Vz. The value we assign to this join can 
also be written in two different ways 

u{x) ® {u{y)®u{z)) = {u{x)®u{y))®u{z). (6) 

This is a functional equation for the operator © for which the general solution is given 
by Aczel [H 

f{u{xyy))=f{u{x))+f{u{y)), (7) 



where / is an arbitrary invertible function, so that many valuations are possible. We 
take advantage of this freedom to choose a valuation v{x) = f{u{x)) that simplifies this 
constraint 

v{xyy)=v{x)+v{y). (8) 

By letting x = ±, equation dS]) implies that v(±) = 0. 

Now that we have a constraint on the valuation for our simple example, we seek the 
general solution for the entire lattice. To derive the general case, we consider the lattice 
in Figure [5] (center) and note that the elements x/\y and z have a null meet, as do the 
elements x and z. Applying ([8]) to these two cases, we get 

v{y) = vixAy)+viz) (9) 
v{xyy) = v{x)+v{z) (10) 

Simple substitution results in the general constraint equation known as the sum rule 

v{xyy)=v{x)+v{y)-v{xAy). (11) 

In general for bi- valuations we have 

v{xyy 1 1) = v{x I t)+v{y \ t) -v{xAy \ t). (12) 

for any context t. Note that the sum rule is not focused solely on joins since it is 
symmetric with respect to interchange of joins and meets. 

At this point, we have derived additivity of the measure, which is considered to be an 
axiom of measure theory. This is significant in that associativity constrains us to have 
additive measures — there is no other option. The cartoon at the right of Fig. [5] illustrates 
the symmetry of the sum rule. The sum of the valuations of the elements at the top and 
bottom of the diamond equals the sum of the valuations of the elements on the right and 
left sides 

v{xyy)+v{xAy)=v{x)+v{y). (13) 



Lattice Products 

Given the linearity of the constraint imposed by associativity (fT3l) . the only remaining 
freedom is that of rescaling. This means that any further constraints must have a multi- 
plicative form. One can combine two lattices via the lattice product where elements are 
combined in as in a Cartesian product. That is, the product of a lattice X with a lattice 
Y will result in a lattice X x F with elements of the form {x,y), where x EX and y . 
The lattice product is associative, so that for three lattices X, Y , and Z, we have 

(Xxy) xZ = Xx (7 xZ) (14) 

with elements of the form {x,y,z). 

The valuation assigned to an element (x, y) clearly must be a function of the valuations 
assigned to x and y. However, the associative relation above imposes a constraint on the 
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FIGURE 6. The chains on the left illustrate the changes in context used to derive the chain rule. The 
diamond in the center illustrates that the degree to which x includes x/\y equals the degree to which x 
includes y. The lattice on the right is used to derive the general product rule describing context change. 
See text below for details. 



valuations assigned to the elements of the lattice product. This constraint ultimately 
results in the following relation 

v{{x,y))=v{x)v{y), (15) 
which is a product rule that applies when one combines two independent spaces. 

The chain rule 

We now focus on bi-valuations and explore changes in context. We begin with a 
special case and consider four ordered elements x <y < z < t. The relationship x < z 
can be divided into two relations, x<y and y < z. In the event that z is considered to be 
the context, this sub-division implies that the context can be considered in parts. Thus 
the bi-valuation we assign to x with respect to context z, w{x \ z), must be related to both 
the bi-valuation we assign to x with respect to context y, w{x \ y), and the bi-valuation 
we assign to y with respect to context z, w{y \ z). That is, there exists a binary operator 
© that relates the bi-valuations assigned to the two steps to the bi-valuation assigned to 
the one step 

w{x I z) = w{x I y) w{y \ z) . (16) 

By extending this to three steps (Fig. [6l left) and considering the bi-valuation w{x \ t) 
relating x and t, via intermediate contexts y and z, results in another associativity 
relationship 

{w{x I y) Qw{y \ z)) 0w(z 1 1) = w{x \ y) {w{y \ z) 0w(z 1 1)) (17) 

Using the associativity theorem again results in a constraint equation for non-negative 
bi-valuations involving changes in context [@]. We call this the chain rule 

w{x\z) =w{x\y)w{y\z) . (18) 



The product rule 

We now focus on extending these results to the more general case illustrated with the 
lattice on the right side of Fig. |6l To begin we focus on the small diamond defined by x, 
y,xyy, and x/\y. If we consider the context to be x, the sum rule gives 

w{x I x) + w{y I x) = w{x\/y \ x) + w{xAy \ x). (19) 

Since x <x and x < x\/ y, we have w{x \ x) = w{x\/y \x) = I, and the sum rule reduces 
to 

w{y \ x) = w{x Ay \ x). (20) 

This relationship is illustrated by the equivalence of the arrows in the diamond in the 
center of Fig. [6l This result will used several times in the derivation that follows. 

Consider the chain where the bi-valuation w{xAyAz\x) with context x is decomposed 
into two parts by introducing the intermediate context ;c A The chain rule gives 

w{xAyAz I x) = w{xAyAz \ xAy)'w{xAy \ x). (21) 

To simplify this relation, consider the diamond defined hy xAy Az,x Ay,y\/ z, zto obtain 

w{xAyAz\xAy)=w{z\xAy). (22) 

Similarly, consider the diamond defined hy x, x\/ y, y Az, and x Ay Az to obtain 

w{xAyAz\x)=w{yAz\x). (23) 

Substituting and ^ into ^ results in 

w{yAz\x)=w{z\xAy)w{y\x), (24) 

which is the general product rule for context change. 

THE VALUATION CALCULUS 

We have derived that associativity of the lattice join leads to the sum rule for valuations 

v{xyy)+v{xAy)=v{x)+v{y). (25) 

which is a key axiom of measure theory. Associativity of the lattice product imposes an 
additional constraint, which results in a product rule 

v{{x,y))=v{x)v{y). (26) 

Extending the concept of valuation to that of a context-dependent bi-valuation, we 
obtain a sum rule 

w{xyy 1 1) +w{xAy \ t) = w{x \ t) -fw(y 1 1) , (27) 



a product rule for combining spaces 



w{{x,y) I (t^Jy)) = w{x I t^)w{y \ ty) , (28) 
and a product rule for context change 

w{y /\z\x) =w{z\x/\y)w{y\x) . (29) 

This calculus of valuations not only derives the condition of additivity of measures, 
but also generalizes traditional measure theory in an important way by introducing the 
concept of context. In addition to the sum rule of measure theory, composition of spaces 
and changes in context introduce two distinct product rules. 

APPLICATIONS 

Since these results are valid for all lattices, we can apply them to a wide array of 
applications. Applying the valuation calculus to the lattice of statements results in 
probability theory where the bi-valuations are conditional probabilities. Bayes theorem 
can be seen to implement a particular change in context. These results can also be 
applied to the lattice of questions resulting in the inquiry calculus, which is an analogous 
calculus of a measure on questions called relevance. 

These results can also be applied to the lattice of experimental setups in quantum 
mechanics with a significant difference where pairs of real numbers are needed to 
quantify the lattice elements rather than scalars as in the examples considered here. 
Joint papers written with Philip Goyal and John Skilling in this proceedings ifioll and 
elsewhere show why quantum amplitudes behave as complex numbers obeying 
Feynman's rules using methods very similar to those described here. 

To reinforce the idea that these sum and product rules are not limited to the domain 
of probability and are instead applicable to any consistent valuation over all lattices, we 
will briefly explore application of the valuation calculus to an unlikely domain — number 
theory. 

Number Theory and Sum and Product Rules 

We consider the lattice of integers less than a greatest integer t ordered by division. A 
portion of the lattice is illustrated in Figure |71 The bottom element is unity. The primes 
are the atomic elements, and the primes and powers of primes are the join-irreducible 
elements, which provides some insight as to why they have a special status in number 
theory. The consistency relation (|3]) illustrates that the join and meet are the number 
theoretic operations least common multiple (1cm) and greatest common divisor (gcd), 
respectively. Since the operation 1cm distributes over gcd and vice versa, the lattice is 
a distributive lattice. This means that we have the freedom to assign valuations to the 
join-irreducible elements (primes and powers of primes) arbitrarily. 
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FIGURE 7. The lattice generated by ordering the integers according to whether one integer divides 
another. The bottom element is unity. The primes are the atomic elements, and the primes and powers of 
primes are the join-irreducible elements. For the purposes of quantification, it is assumed that the lattice 
is not infinite in extent so that there exists a greatest integer in the set. The join of two elements is their 
least common multiple, and the meet is their greatest common divisor 



An interesting valuation assignment is one where v{p) = logp for all p such that p is 
a prime or power of a prime. For two primes p and q, we have according to ([8]) 

v(lcm(;?,^)) = log;? -flog (30) 
Specifically, for p = 2 and = 3, we have that 

v(6) = log 2 + log 3 = log 6. (31) 
In general, the sum rule holds. For example, 

v(12) =log4 + log6-log2 = logl2, (32) 

since lcm(4,6) = 12 and gcd(4,6) = 2. 

The chain rule and product rule hold as well. Let the greatest integer in the set be 
denoted by t, and assign w{p \ t) = logp for all p such that p is a prime or power of a 
prime. It is straightforward to show that d{n \ 1) = 1 so that the degree to which unity 
divides any integer n is 1. Similarly, d{n \ n) = \ and d{n | m) = 1 for all m<n. 

We can use the product rule to write 

d{m/\n I t) = d{m \ t)d{n \ mAt) (33) 
= d{n\t)d{m\nAt) (34) 

and equate the two right-hand sides as in the derivation of Bayes' Theorem to obtain 

, , d(m\t)d(n\mAt) 

d{m\nAt) = ^ — ' ' \ ' '-. (35) 

d[n I t) 

In the case where we let m < n < ? we have 

d{m I n) = (36) 
logn 



which for m — 2 and n = A gives us the degree to which four divides two 

rf(2|4) = J^ = l/2. (37) 
log 4 

CONCLUSION 

In this paper we derive the valuation calculus. Associativity of the join gives rise to the 
sum rule, which is symmetric with respect to interchange of joins and meets. Associa- 
tivity of the lattice product results a product rule, which dictates how valuations are to 
be combined when taking lattice products. Associativity of changes of context result in 
a product rule for bi- valuations that dictate how valuations should be manipulated when 
changing context. These results are valid for all lattices — not just the Boolean algebra 
of probability theory. This new theory of quantification opens the door to a wide variety 
of novel applications, such as decision theory and concept lattices. 

With respect to probability theory the result of this work is a new foundation that en- 
compasses and generalizes both the Cox and Kolmogorov formulations. By introducing 
probability as a bi-valuation defined on a lattice of statements we can quantify the degree 
to which one statement implies another. This generalization from logical implication to 
degrees of implication not only mirrors Cox's notion of plausibility as a degree of belief, 
but includes it. The main difference is that Cox's formulation is based on a set of desider- 
ata derived from his particular notion of plausibility; whereas here the symmetries of 
lattices in general form the basis of the theory and the meaning of the derived measure is 
inherited from the ordering relation, which in this case is implication. The fact that these 
lattices are derived from sets means that this work encompasses Kolmogorov's formu- 
lation of probability theory as a measure on sets. However, mathematically this theory 
improves on Kolmogorov's foundation by deriving, rather than assuming, summation. 
Furthermore, this foundation further extends Kolmogorov's measure-theoretic founda- 
tion by introducing the concept of context. This leads directly to probability necessarily 
being conditional, and Bayes' Theorem follows as a direct result of the product rule by 
which it implements change in context. 

By better understanding the foundation, we expect to be able to more readily extend 
our theory of inference into greater domains. This will include a better understanding 
of maximum entropy and perhaps lead to entropic inference where both data and con- 
straints are readily combined to aid our inferences. In another direction we consider 
inference in quantum mechanics, and our first steps in this direction can be found in a 
joint paper in this volume lilOll and elsewhere jull . 
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